Efficient Multiclass Classification Using Feature Selection in High-Dimensional Datasets

نویسندگان

چکیده

Feature selection has become essential in classification problems with numerous features. This process involves removing redundant, noisy, and negatively impacting features from the dataset to enhance classifier’s performance. Some are less useful than others or do not correlate system’s evaluation, their removal does affect In most cases, a monotonically decreasing impact on performance increases accuracy. Therefore, this research aims propose dimensionality reduction method using feature technique paper proposes novel feature-selection approach that combines filter wrapper techniques select optimal Mutual Information Sequential Forward Method 10-fold cross-validation. Results show proposed algorithm can reduce by more 75% datasets large achieve maximum accuracy of 97%. The outperforms performs similarly existing ones. could be better option for minimized

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection in High-Dimensional Classification

High-dimensional discriminant analysis is of fundamental importance in multivariate statistics. Existing theoretical results sharply characterize different procedures, providing sharp convergence results for the classification risk, as well as the l2 convergence results to the discriminative rule. However, sharp theoretical results for the problem of variable selection have not been established...

متن کامل

Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...

متن کامل

An iterative SVM approach to feature selection and classification in high-dimensional datasets

The support vector machine (SVM) is the state-of-the-art classification method, and the doubly regularized SVM (DrSVM) is its important extension based on the elastic net penalty. DrSVM has been successfully applied in handling variable selection and while retaining (or discarding) correlated variables. However, it is challenging to solve this model. In this paper we develop an iterative l2-SVM...

متن کامل

Improved PSO for Feature Selection on High-Dimensional Datasets

Classification on high-dimensional (i.e. thousands of dimensions) data typically requires feature selection (FS) as a pre-processing step to reduce the dimensionality. However, FS is a challenging task even on datasets with hundreds of features. This paper proposes a new particle swarm optimisation (PSO) based FS approach to classification problems with thousands or tens of thousands of feature...

متن کامل

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12102290